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two, or three b-quark jets. The study is performed using a sample of proton-proton 
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with the missing transverse energy distribution as the principal search variable. The 
integrated luminosity of the sample is 4.98 fb _1 . The observed number of events is 
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production of supersymmetric particles decaying to b-quark-enriched final states in 
the context of simplified model spectra. 
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1 Introduction 

Many extensions of the standard model (SM) predict that events in high-energy proton-proton 
collisions can contain large missing transverse energy (E™ 1SS ) and multiple, high- transverse mo- 
mentum (px) jets. For example, in R-parity-conserving [1 j models of supersymmetry (SUSY) J2|, 
SUSY particles are created in pairs. Each member of the pair initiates a decay chain that 
terminates with the lightest SUSY particle (LSP) and SM particles. If the LSP only interacts 
weakly, as in the case of a dark-matter candidate, it escapes detection, potentially yielding sig- 
nificant E™ ss . Furthermore, in some scenarios 0, the SUSY partners of the bottom and top 
quarks can be relatively light, leading to the enhanced production of events with bottom-quark 
jets (b jets). Events of this type, with b jets and large E™ 1SS , represent a distinctive topological 
signature that is the subject of a search described in this paper. 

We present a search for new physics (NP) in events with large E™ 1SS , no isolated leptons, three 
or more high-px jets, and at least one, two, or three b jets. The analysis is based on a sample of 
proton-proton collision data collected at = 7 TeV with the Compact Muon Solenoid (CMS) 
detector at the CERN Large Hadron Collider (LHC) in 2011, corresponding to an integrated 
luminosity of 4.98 fb _1 . Recent searches for NP in a similar final state are presented in Refs. (H- 
7J. Our analysis is characterized by a strong reliance on techniques that use control samples in 
data to evaluate the SM background. The study is performed using the E™ 1SS distribution as the 
principal search variable. 

The principal sources of the SM background are events with top quarks, comprising tt pair and 
single-top-quark events, events with a W or Z boson accompanied by jets, and multijet events 
described by quantum chromodynamics (QCD). Diboson (WW, ZZ, or WZ) events represent 
a smaller source of background. For events with a W boson or a top quark, significant E™ 1SS 
can arise if a W boson decays into a charged lepton and a neutrino. The neutrino provides a 
source of genuine E™ 1SS . Similarly, significant E™ 1SS can arise in events with a Z boson if the 
Z boson decays to two neutrinos. For QCD events, significant E™ 1SS arises primarily from the 
mismeasurement of jet pj. A smaller component of the QCD background arises from events 
with semileptonic decays of b and c quarks. 

We interpret our results in the context of simplified model spectra (SMS) ll8T[TTl|. which pro- 
vide a general framework to characterize NP signatures. They include only a few NP particles 
and focus on generic topologies. We consider the SMS scenarios denoted Tlbbbb and Tltttt. 
Event diagrams are shown in Fig. [I] These two models are characterized by b-jet-enriched fi- 
nal states, large jet multiplicities, and large E™ ISS values, making our analysis sensitive to their 
production. For convenience, we express SMS phenomenology using SUSY nomenclature. In 
Tlbbbb (Tltttt), pair-produced gluinos g each decay into two b-quark jets (t-quark jets) and the 
LSP, taken to be the lightest neutralino %°. The LSP is assumed to escape detection, leading to 
significant E™ 1SS . If the SUSY partner of the bottom quark (top quark) is much lighter than any 
other squark, with the gluino yet lighter, gluino decays are expected to be dominated by the 
three-body process shown in Fig. [l|a) [Fig. [TJb)]. 

As benchmark NP scenarios, we choose the Tlbbbb and Tltttt models with gluino mass nig = 
925 GeV and LSP mass m^c,j> = 100 GeV, with normalization to the next-to-leading order (NLO) 
plus next-to-leading-logarithm (NLL) cross section [T2| . These two benchmark models lie near 
the boundary of our expected sensitivity. 

In Sections 2p we describe the detector and event selection. Section |4j introduces the A(^ m i n 
variable, used in the evaluation of the QCD background. Our techniques to evaluate the SM 
background from control samples in data are presented in Section [5] In Section [6] we describe 



2 



3 Event selection 



our analysis framework, based on a likelihood method that simultaneously determines the SM 
background and tests the consistency of NP models with the data, taking into account possible 
NP contamination of control sample regions. The interpretation of our results is presented in 
Section [7[ Section [8] contains a summary. 



b t 




Figure 1: Event diagrams for the (a) Tlbbbb and (b) Tltttt simplified models. 



2 Detector and trigger 

A detailed description of the CMS detector is given elsewhere [13}. The CMS coordinate sys- 
tem is defined with the origin at the center of the detector and the z axis along the direction of 
the counterclockwise beam. The transverse plane is perpendicular to the beam axis, with (p the 
azimuthal angle (measured in radians), 9 the polar angle, and rj = — ln[tan(0/2)] the pseudo- 
rapidity. A superconducting solenoid provides an axial magnetic field of 3.8 T. Within the field 
volume are a silicon pixel and strip tracker, a crystal electromagnetic calorimeter, and a brass- 
scintillator hadron calorimeter. Muons are detected with gas-ionization chambers embedded 
in the steel flux-return yoke outside the solenoid. The tracker covers the region \rj\ < 2.5 and 
the calorimeters \rj\ < 3.0. The region 3 < < 5 is instrumented with a forward calorimeter. 
The near-hermeticity of the detector permits accurate measurements of energy balance in the 
transverse plane. 

The principal trigger used for the analysis selects events based on the quantities Hj and H™ 1SS , 
where Hj is the scalar sum of the transverse energy of jets and H™ 1SS the modulus of the cor- 
responding vector sum. Due to increasing beam collision rates, trigger conditions varied over 
the period of data collection. The most stringent trigger requirements were Hj > 350 GeV and 
H™ 1SS > 110 GeV. The efficiency of the Hj component for the final event selection is measured 
from data to be 86% (99%) for H T values of 400 GeV (500 GeV). The efficiency of the H^ iss com- 
ponent is 98% for E™ 1SS > 250 GeV. Appropriate corrections are applied to account for trigger 
inefficiencies and uncertainties in the various control and search regions of the analysis. 

3 Event selection 

Physics objects are defined using the particle flow (PF) method [14], which is used to recon- 
struct and identify charged and neutral hadrons, electrons (with associated bremsstrahlung 
photons), muons, tau leptons, and photons, using an optimized combination of information 
from CMS subdetectors. The PF objects serve as input for jet reconstruction, based on the anti- 
kj algorithm [15j with distance parameter 0.5. Jet corrections [fl6| are applied to account for 
residual effects of non-uniform detector response in both px and tj. The missing transverse 
energy E™ 1SS is defined as the modulus of the vector sum of the transverse momenta of all PF 
objects. The E™ ss vector is the negative of the same vector sum. 
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Table 1: The definition of the signal (SIG) regions. The minimum requirements on Hj, £ miss , 
and the number of tagged b jets N\,j ets are given. 



Signal region 


Hj [GeV] 


£ miss [GeV] 


Nbjets 


lb-loose 


1BL 


> 400 


> 250 


> 1 


lb-tight 


1BT 


> 500 


> 500 


> 1 


2b-loose 


2BL 


> 400 


> 250 


> 2 


2b-tight 


2BT 


> 600 


> 300 


> 2 


3b 


3B 


> 400 


> 250 


> 3 



The basic event selection criteria are as follows: 

• at least one well-defined primary event vertex [17]; 

• at least three jets with p T > 50 GeV and \rj \ < 2.4; 

• a lepton veto defined by requiring that there be no identified, isolated electron or 
muon candidate |Tl8l [19| with pj > 10 GeV; electron candidates are restricted to 
\rj\ < 2.5 and muon candidates to \rj\ < 2.4; 

• A</> mm > 4.0, where the At^n variable is described in Section [I] 

Electrons and muons are considered isolated if the scalar sum of the transverse momenta of 
charged hadrons, photons, and neutral hadrons surrounding the lepton within a cone of radius 
a/ (A^) 2 + (A</>) 2 = 0.3, divided by the lepton pj value itself, is less than 0.20 for electrons and 
0.15 for muons. 

To identify b jets, we use the combined-secondary-vertex algorithm at the medium working 
point Il20ll . This algorithm combines information about secondary vertices, track impact pa- 
rameters, and jet kinematics, to separate b jets from light-flavored-quark, charm-quark, and 
gluon jets. To increase sensitivity to NP scenarios, which often predict soft b jets, we use all 
tagged b jets with pj > 30 GeV. The nominal b-jet-tagging efficiency is about 75% for jets with 
a pj value of 100 GeV, as determined from a sample of b-jet-enriched dijet events fl20| (for b 
jets with pj 30 GeV, this efficiency is about 60%). The corresponding misidentification rate 
is about 1.0%. We correct the simulated efficiencies for b-jet tagging and misidentification to 
match the efficiencies measured with control samples in the data. The b-tagging correction 
factor depends slightly on the jet pj and has a typical value of 0.95. The uncertainty on this 
correction factor varies from 0.03 to 0.07 for b jets with pj from 30 to 670 GeV, and is taken to 
be 0.13 for b jets with p T > 670 GeV. 

We define five signal regions, which partially overlap, to enhance sensitivity in different kine- 
matic regimes. The five regions correspond to different minimum requirements on Hj, £ miss , 
and the number of b jets. Hj is calculated using jets with pj > 50 GeV and \t]\ < 2.4. The five 
regions, denoted 1BL, 1BT, 2BL, 2BT, and 3B, are specified in Table [l] and were chosen without 
considering the data to avoid possible bias. The regions are selected based on expected signal 
and background event yields in simulation, to provide maximal sensitivity for discovery of the 
NP scenarios considered in this paper or, in the case of non-discovery, to best set limits on their 
parameters. Throughout this paper, we use the generic designation "SIG" to refer to any or all 
of these five signal regions. 

As examples illustrating the characteristics of events with at least one, two, or three tagged 
b jets, the £ miss distributions of events in the 1BL, 2BT, and 3B samples are shown in Fig. [2] 
The results are presented in comparison with Monte Carlo (MC) simulations of SM processes. 
Results from the benchmark Tlbbbb and Tltttt NP models mentioned in the Introduction are 
also shown. The simulated tt, W+jets, and Z+jets events are produced at the parton level with 
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Table 2: The number of data events and corresponding predictions from MC simulation for the 
signal regions, with normalization to 4.98 fb _1 . The uncertainties on the simulated results are 
statistical. 





1BL 


1BT 


2BL 


2BT 


3B 


Data 


478 


11 


146 


45 


22 


Total SM MC 


496 ±7 


13.3 ±0.6 


148 ±2 


36.8 ± 0.9 


15.0 ±0.2 


tt 


257 ±2 


3.6 ±0.2 


111 ± 1 


26.7 ±0.4 


12.6 ±0.2 


Single-top quark 


26.0 ± 1.0 


0.8 ±0.2 


9.1 ±0.5 


2.7 ±0.3 


0.88 ± 0.09 


W+jets 


80.0 ± 1.0 


2.8 ±0.2 


7.7 ±0.3 


2.2 ±0.2 


0.38 ± 0.05 


Z — > vv 


104 ±2 


5.3 ± 0.4 


13.8 ±0.7 


3.5 ±0.3 


0.80 ± 0.10 


Diboson 


1.8 ±0.1 


0.10 ±0.02 


0.27 ± 0.04 


0.05 ± 0.02 


0.02 ± 0.01 


QCD 


28.0 ± 6.0 


0.70 ± 0.20 


6.0 ±1.0 


1.7 ±0.6 


0.29 ± 0.07 



the MadGraph5. 1.1.0 |2T| event generator. Single-top-quark events are generated with the 
POWHEG 301 [22] program. The PYTHIA 6.4.22 program [23] is used to produce diboson and 
QCD events. For all simulated samples, PYTHIA 6.4 is used to describe parton showering and 
hadronization. All samples are generated using the CTEQ6 [24] parton distribution functions. 
The description of the detector response is implemented using the Geant4 [25 1 program. The 
tt sample is normalized to the measured cross section 1126 1 . The other simulated samples are 
normalized using the most accurate cross section calculations currently available. The jet en- 
ergy resolution in the simulation is corrected to account for a small discrepancy with respect 
to data |16|. In addition, the simulated samples are reweighted to describe the probability 
distribution observed in data for overlapping pp collisions within a bunch crossing ("pileup")- 
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Figure 2: The distributions of E™ 1SS for event samples selected with the (a) 1BL, (b) 2BT, and 
(c) 3B requirements, except for the requirement on E™ 1SS . The hatched bands show the statistical 
uncertainty on the total SM background prediction from simulation. The rightmost bin in all 
plots includes event overflow. The open histograms show the expectations for the Tlbbbb 
(solid line) and Tltttt (dashed line) NP models, both with m g = 925 GeV, m LSP = 100 GeV, and 
normalization to NLO+NLL. 

The numbers of events in the different signal regions are listed in Table [2] for data and simula- 
tion. The simulated results are for guidance only and are not used in the analysis. 



4 The A</> min variable 

Our method to evaluate the QCD background is based on the A^ m in variable. This method 
presumes that most E™ 1SS in a QCD event arises from the pj mismeasurement of a single jet. 

The A^min variable is a modified version of the commonly used quantity A^min = min(A</>,) 
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Figure 3: Illustration of variables used to calculate A</> m i n for the case of an event with exactly 
three jets with pj > 30 GeV. The light-shaded (light gray) solid arrows show the true pj values 
of the three jets i, j, and k. The dark-shaded (black) solid arrows show the reconstructed jet pj 
values. The angles of jets / and k with respect to the direction opposite to jet i are denoted ol; 
and The E™ 188 for the event is shown by the dotted (red) arrow. The component of E™ 88 
perpendicular to jet i, denoted T,-, is shown by the dotted (red) line. crj. is the uncertainty on T ; -. 
A(pj is the angle between E™ 88 and jet i. 

(i = 1, 2, 3), the minimum azimuthal opening angle between the E™ 88 vector and each of the 
three highest-pj jets in an event. Misreconstruction of a jet primarily affects the modulus of its 
transverse momentum but not its direction. Thus QCD background events are characterized 
by small values of A(p m ±n- The A(p m m variable is strongly correlated with E™ 188 , as discussed 
below. This correlation undermines its utility for the evaluation of the QCD background from 
data. To reduce this correlation, we divide the Acpj by their estimated resolutions a^,i to obtain 
A<£ min = min(A^/cr A ^). 

The resolution for jet i is evaluated by considering the pj resolution <t Pt of the other jets 
in the event. The uncertainty aj. on the component of the E™ 88 vector perpendicular to jet i 
is found using Oj. = Y^niP'pi^ sina. n ) 2 , where the sum is over all other jets in the event with 
pi > 30 GeV and oc n is the angle between jet n and the direction opposite jet i. The situation is 
depicted in Fig. [3] for an event with exactly three jets with pj > 30 GeV. Our estimate of the 
Acp resolution is cr^s,i = arctan^j-./E™ 188 ). (Note: arcsin(c>T ; /E™ 188 ) is technically more correct 
in this expression; we use arctan(c7 T ./E^ uss ) because it is computationally more robust while 
being equivalent for the small angles of interest here.) For the jet pj resolution, it suffices to use 
the simple linear parametrization o~ pj = 0.10 pj [fl6]|. 

Figure |4](a) shows the ratio of the number of events with A</> m i n > 0.3 to the number with 
A^min < 0.3 as a function of E™ 188 , for a simulated QCD sample selected with the 1BL require- 
ments except for those on A<^ m i n and Ej^ 1 ^ (A(p m i n > 0.3 or a similar criterion is commonly 
used to reject QCD background, see, e.g., Refs. [5-7]). The strong correlation between A^ m j n 
and Ej 1188 is evident. The corresponding result based on A</> m i n is shown in Fig. |4|b). For the 
latter figure we choose A^ m i n = 4.0 in place of A^ m i n = 0.3, which yields a similar selection 
efficiency. For values of Eip 188 greater than about 30 GeV, the distribution based on A^ m i n is 
seen to be far less dependent on E^ 1188 than that based on Acpm^. Figure |ijc) shows the result 
corresponding to Fig. |4pb) for events with zero tagged b jets. Comparing Figs. |4pb) and (c), it 
is seen that the ratio N(A^ m i n > 4.0)/N(A<£ m j n < 4.0) has an approximately constant value of 
about 0.13 (for E!j? 188 > 30 GeV) irrespective of the number of b jets. 

The measured results for N(A^ m j n > 4.0)/N(A^ m i n < 4.0) with zero b jets, for events with 
Hj > 400 GeV, 500 GeV, and 600 GeV, are shown in Fig. [5] By requiring that there not be a 
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Figure 4: QCD-simulation results: (a) the ratio of the number of events that pass the criterion 
A</>min > 0.3 (Npass) to the number that fail (Nf a ii) as a function of E™ ss , for events selected with 
the 1BL requirements except for those on A^min and £™ 1SS ; (b) The analogous ratio of events 
with A^ m in > 4.0 to those with A^ m in < 4.0; and (c) the same as (b) for events with zero b jets. 
The QCD-background estimate is based on the relative flatness of the distributions in (b) and 
(c) for E™ 1SS > 30 GeV, as illustrated schematically by the dashed lines. 



b jet, we reduce the contribution of top-quark events, which is helpful for the evaluation of 
QCD background (Section pTT) . The data in Fig. [5] are collected with a pre-scaled Hj trigger, 
allowing events to be selected at low E™ 1SS without a trigger bias. The data in Fig.|5|a) are seen 
to somewhat exceed the simulated predictions. The trend is visible in Fig.|5jb) to a lesser extent. 
This modest discrepancy arises because the A^ m in distribution is narrower in the simulation 
than in data. Since our method to evaluate the QCD background is based on the measured 
distribution, this feature of the simulation does not affect our analysis. The data in Fig. [5] are 
seen to exhibit the general behavior expected from the simulation. The region below around 
100 GeV is seen to be dominated by the QCD background. 
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Figure 5: The ratio N(A^ min > 4.0)/N(A<£ min < 4.0), denoted N p ass/Nf ai i, as a function of E^ ss 
for the zero-b-jet sample, for events selected with the basic event selection of the analysis except 
for the requirements on E™ ss and the number of b jets. The results are shown for (a) Hj > 
400 GeV, (b) Hj > 500 GeV, and (c) Hj > 600 GeV. The histograms show simulated predictions 
for the QCD and total SM background. 



5 Background evaluation 

In this section we describe our methods to evaluate the SM background from control samples 
in data. Each of the three main backgrounds - from QCD, Z+jets, and top-quark and W+jets 
events (where "top quark" includes both tt and single-top-quark events) - is evaluated sepa- 
rately. We group top quark and W+jets events together because they have a similar experimen- 



5.1 QCD background 
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tal signature. Note that our final results for the total SM background are derived from a global 
likelihood procedure that incorporates our background evaluation procedures into a single fit, 
described in Section |6] 

QCD background is evaluated using the A^ m in variable. Background from Z+jets events is 
evaluated by scaling the measured rates of Z — > £ + £~ (£ = e or y) events. To estimate the 
top-quark and W+jet background, we employ two complementary techniques. One, which 
we call the nominal method, is simple and almost entirely data based, while the other, which 
we call the £^ uss -reweighting method, combines results based on data with information from 
simulation to examine individual sources of top-quark and W+jets background in detail. 

5.1 QCD background 

The low level of correlation between At^n and E™ 1SS allows us to employ a simple method 
to evaluate the QCD background from data. As discussed in Section |I| the ratio N(A^ m i n > 
4.0)/N(A<£ m j n < 4.0) is approximately independent of E^ ass , and also of the number of b jets, 
for QCD events. Furthermore, the E™ ss distribution below 100 GeV is expected to be dominated 
by QCD events, especially for events with zero b jets (Fig. [5). We therefore measure N(A^ m i n > 
4.0)/N(A<£ m i n < 4.0) in a low E™ 1SS region of the zero-b-jet sample and assume this equals 
N(A<£ min > 4.0)/ N(A<£ min < 4.0) for QCD events at all £^ iss values, also for samples with b 
jets such as our signal samples. 
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Figure 6: Schematic diagram illustrating the regions used to evaluate the QCD background. 
The low sideband (LSB) and low sideband-low A^^ (LSB-LDP) regions correspond to 50 < 
£miss <- 100 GeV. The sideband (SB) and sideband-low A^^ (SB-LDP) regions correspond 
to 150 < E™ iss < 250 GeV. The signal (SIG) and signal-low A<£ min (SIG-LDP) regions have 
E™ ss ranges corresponding to those in Table [l] The designation "SIG" generically refers to 
any of the signal regions in this table. The SIG and SIG-LDP regions shown in the diagram 
explicitly depict the loose kinematic signal regions 1BL, 2BL, and 3B, which require E™ 1SS > 
250 GeV, but implicitly include the tight kinematic signal regions 1BT and 2BT, which require 
£miss y 500 GeV and 300 GeV, respectively. For each choice of signal region, the condition on 
Hj specified in Table [I] for that region is applied to all six panels of the diagram, while the 
condition on the number of b jets is applied to the four panels denoted "Standard selection." 
All regions with the low A^ m in (LDP) designation require 0.0 < A^ m j n < 4.0, while the other 
regions require A<£ m in > 4, 0. 

To perform this measurement, we divide the data into sideband and signal regions in the A^ m j n - 
£miss pj ane ^ as illustrated schematically in Fig. [6] We use the low-E^ 851 interval defined by 
50 < E^ iss < 100 GeV and A^ > 4.0. We call this interval the low sideband (LSB) region. We 
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5 Background evaluation 



Table 3: The relative systematic uncertainties (%) for the QCD background estimate in the 
signal regions. Because the 1BT QCD background estimate is zero (Section \5.5\ , we do not 
present results for 1BT in this table. 





1BL 


2BL 


2BT 


3B 


MC subtraction 


23 


43 


44 


24 


MC closure 


37 


41 


150 


45 


LSB reweighting 


7.9 


7.9 


9.8 


7.9 


Total 


44 


60 


160 


52 



also define low A</> min (LDP) intervals A<£ min < 4.0. We do this not only for the 50 < E™ 1SS < 
100 GeV region, but also for the signal regions (SIG) and for a sideband (SB) region defined by 
150 < £^ iss < 250 GeV. We denote these regions LSB-LDP, SIG-LDP, and SB-LDP, respectively. 
The LSB-LDP region is dominated by QCD events. Similarly the SB-LDP and SIG-LDP regions 
largely consist of QCD events, as illustrated for the 1BL, 2BT, and 3B SIG-LDP regions in Fig. [Zj 
For higher values of E^ 5 , contributions to the SB-LDP and SIG-LDP regions from events with 
a top quark or a W or Z boson become more important. This contamination is subtracted using 
simulation. 



CMS, is = 7 TeV, L 




CMS, is = 7 TeV, L = 4.98 fb 




CMS, is = 7 TeV, L = 4.98 fb -1 




A$ m . A$ m 

Figure 7: The distributions of A<£ m i n in data and simulation for events selected with the (a) 1BL, 
(b) 2BT, and (c) 3B requirements, except for the requirement on A<^ m i n . The simulated spectra 
are normalized as in Fig. [2] The hatched bands show the statistical uncertainty on the total 
SM prediction from simulation. The open histograms show the expectations for the Tlbbbb 
(solid line) and Tltttt (dashed line) NP models, both with m g = 925 GeV, m LSP = 100 GeV, and 
normalization to NLO+NLL. The SIG-LDP regions correspond to A<^ m i n < 4.0 and the signal 
(SIG) regions to A<p m m > 4.0. 

Applying corrections for the non-QCD components of the SIG-LDP and SB-LDP regions, our 
estimates of the QCD yields in the SIG and SB regions are therefore: 

N LS B 



QCD 
SIG 



Nlsb-ldp 

N LS b 
Nlsb-ldp 



x (^SIG-LDP 
X (Nsb-LDP " 



w to P/ MC 
JV SIG-LDP 

M top,MC 
JV SB-LDP 



N, 



W&ZMC\ 



SIG-LDP^ 



M W&Z,MC 
JV SB-LDP 



(1) 

(2) 



where the LSB and LSB-LDP results are derived from the zero-b-j et, p re-scaled Hj trigger sam- 
ple mentioned in Sectionkl The result for N^ CD is used in Section 5.3 The ratio Nlsb/ Nlsb-ldp 



is found to depend on the number of primary vertices (PV) in the event and thus on the LHC 
instantaneous luminosity. Before evaluating Eqs. ^ and |2jl, we therefore reweight the events 
in the pre-scaled sample to have the same PV distribution as the standard sample. 

Systematic uncertainties are summarized in Table [3] The systematic uncertainty associated 
with the subtraction of events with either a top quark or a W or Z boson from the SIG-LDP and 



5.2 Z+jets background 
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SB-LDP regions is determined by varying the subtracted values by their uncertainties, evalu- 
ated as described in Section [7| The systematic uncertainty associated with the assumption that 
E™ 1SS and A^mjn are uncorrelated is evaluated with an MC closure test, namely by determining 
the ability of the method to predict the correct yield using simulated samples. We compute 
(Mrue — Npred) / N pre d, where N pre d is the predicted number of QCD events in the signal region, 
estimated by applying the above procedure to simulated samples treated as data, and Nt rue is 
the true number. We assign the result, added in quadrature with its statistical uncertainty, as a 
symmetric systematic uncertainty. This uncertainty is dominated by statistical uncertainties for 
Nt m e- The closure test is performed both for the standard simulated samples and for simulated 
samples that are reweighted to account for discrepancies in the jet multiplicity distributions be- 
tween data and simulation; we take the larger closure discrepancy as the uncertainty. A third 
systematic uncertainty is evaluated by taking ±100% of the shift in the result caused by the PV 
reweighting of N lsb /N LS b-ldp- 

As a cross-check, we vary the definition of the LSB by raising and lowering its lower edge by 
10 GeV, which alters the number of events in the LSB by more than a factor of two in each case. 
The observed change in the QCD background estimate is negligible. 

5.2 Z+jets background 

Events with a Z boson and one or more b jets present an irreducible background when the Z 
decays to two neutrinos. We evaluate this background by reconstructing Z — > £ + £~ events 
(£ = e or }i) and removing the £ + and £~. Fits are performed to determine the Z — > £ + £~ yields, 
which are then corrected for background and efficiency. The efficiency is e = A ■ e tr ig ■ c\ KC0 • 
e Lei' wnere the geometrical acceptance A is determined from simulation while the trigger e^g, 
lepton reconstruction e/ reco , and lepton selection e^ se i efficiencies are determined from data. 
The corrected Z — > £ + £~ yields are used to estimate the Z — > vv background through scaling 
by the ratio of branching fractions, BR(Z -> i/v)/BR(Z -> £+£-) = 5.95 ± 0.02 (23, after 
accounting for the larger acceptance of Z —¥ vv events. 

The Z — > £ + £~ yields are small or zero in the signal regions. To increase these yields, we 
select events with the signal-sample requirements except with a significantly looser b-tagging 
definition. A scale factor derived from a control sample in data is then applied to estimate the 
number of Z — > £ + £~ events in the signal regions. The control sample is defined with the same 
loosened b-tagging definition, but without requiring the presence of a Z boson, and also by 
reversing the A</> mrn requirement, i.e., we require A^ m i n < 4.0, which yields a control sample 
with a b-jet content similar to that in the Z — > £ + £~ and Z — > vv events. All other selection 
criteria are the same as for the corresponding signal sample. The scale factors are given by the 
fraction of events in the control sample that passes the nominal b-tagging requirements. The 
scale factors have values around 0.30, 0.07, and 0.01 for the samples with > 1, > 2, and > 3 
b jets, respectively. We verify that the output of the b-tagging algorithm is independent of the 
presence of a Z. 

We validate our method with a consistency test, applying the above procedure to data samples 
with loosened restrictions on Hj and E™ 1SS . We find the number of predicted and observed 
Z — > £ + £~ events to be in close agreement. 

Systematic uncertainties are summarized in Table [4] We evaluate a systematic uncertainty on 
the scale factors by loosening and tightening the b-tagging criterion of the control sample and 
taking half the difference between the two results as an uncertainty. The size of the control 
sample changes by about ±30% in these variations. In addition, we use A</> mrn > 4.0 rather 
than A^ m i n < 4.0 to define the control sample and calculate the difference with respect to the 
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5 Background evaluation 



Table 4: The relative systematic uncertainties (%) for the Z — > 
signal regions, determined for Z — > e + e~ (Z — > f/ + /0 events. 



i/v background estimate in the 





1BL 


1BT 


2BL 


2BT 


3B 


Scale factors 


17 (20) 


17 (20) 


49 (61) 


49 (61) 


140 (110) 


Non-resonant background 


10 (8) 


10 (8) 


10 (8) 


10 (8) 


10 (8) 


Acceptance 


3(3) 


6(8) 


3(3) 


4(4) 


3(3) 


Lepton selection efficiency 


5(4) 


5(4) 


5(4) 


5(4) 


5(4) 


Trigger efficiency 


5(5) 


5(5) 


5(5) 


5(5) 


5(5) 


MC closure 


11 (19) 


11 (19) 


11 (19) 


11 (19) 


11 (19) 


Total 


24 (30) 


25 (30) 


52 (65) 


52 (65) 


150 (110) 



nominal results. Finally we evaluate the percentage difference between the number of pre- 
dicted and observed events found with the consistency test described above. The three terms 
are added in quadrature to define the systematic uncertainty of the scale factors. We evaluate a 
systematic uncertainty associated with the non-resonant £ + £~ background to Z — > £ + £~ events 
by comparing the fraction of fitted events in the Z — > £ + £~ peak from the nominal fit with 
those found using either a loosened Hj or a loosened Eij? iss restriction. The RMS of the three 
results is added in quadrature with the statistical uncertainty from the nominal fit to define the 
systematic uncertainty. The 1BL selection is used to determine this uncertainty for all signal 
regions. A systematic uncertainty for the acceptance is defined by recalculating the acceptance 
after varying the px and tj ranges of the £ + and £~ . The largest difference with respect to the 
nominal result is added in quadrature with the statistical uncertainty of the acceptance. A sys- 
tematic uncertainty is defined for the lepton selection efficiency and analogously for the trigger 
efficiency by recalculating the respective efficiency after varying the requirements on Hj, E™ 1SS , 
A<£min/ the number of jets, and the number of b jets (the number of jets is found using all jets 
with px > 50GeV and \r/\ < 2.4). We also use alternative signal and background shapes in 
the fits used to extract the Z — > t + £~ event yields. The maximum variations from each case 
are added in quadrature with the statistical uncertainty from the nominal method to define the 
systematic uncertainties. Finally, we evaluate a systematic uncertainty based on an MC closure 



test in the manner described in Section 5.1 We use the SB region to determine this uncertainty. 



An analogous procedure to that described above is used to evaluate the number of Z — > vv 
events Ngg^ 1 '" in the SB regions (150 < E™ 1SS < 250 GeV), along with the corresponding uncer- 
tainty. 



5.3 Top-quark and W+jets background (nominal) 

For most signal regions, tt events are expected to be the dominant background (Table [2]). Back- 
grounds from single-top-quark and W+jets events are expected to be smaller but to have a 
similar signature. Almost all top-quark and W+jets background in our analysis arises either 
because a W boson decays leptonically to an e or a p, with the e or p unidentified, not isolated, 
or outside the acceptance of the analysis, or because a W boson decays to a hadronically de- 
caying t lepton. We find empirically from studies with simulation that the shape of the E™ ss 
distribution is very similar for all top-quark and W+jets background categories that enter the 
signal (Table [l]) or sideband (150 < E™ 1SS < 250 GeV) regions, regardless of whether the W bo- 
son decays to e, p, or t, or whether a t lepton decays hadronically or leptonically. We also find 
that this shape is well-modeled by the E™ 1SS distribution of a single-lepton (SL) control sample 
formed by inverting the lepton veto, i.e., by requiring that exactly one e or one p be present 
using the lepton identification criteria of Section |3j in a sample whose selection is otherwise 
the same as the corresponding signal sample, except to reduce the potential contribution of 
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NP to the SL samples, we impose an additional restriction Mj < 100 GeV on the SL samples 
(only), where Mj is the transverse W-boson mass formed from the charged lepton and E™ ss 
momentum vectors. As an illustration, Fig. [H] shows a comparison based on simulation of the 
E™ 1SS distributions in the signal and SL samples, for events selected with the 1BL, 2BT, and 3B 
criteria. 

The E™ 1SS distributions of events in the SL samples with the 1BL, 2BT, and 3B requirements are 
shown in Fig. [9] The distributions are seen to be overwhelmingly composed of tt events. The 
expected contribution of the Tlbbbb NP scenario is found to be negligible, while that of the 
Tltttt scenario is small (compare with Fig. |2j|. 




Figure 8: The distributions of E™ ss in simulated events selected with the (a) 1BL, (b) 2BT, and 
(c) 3B requirements, except for the requirement on Eij? iss . The square (triangle) symbols show 
the results for signal (single-lepton SL control) sample events. The small plots below the main 
figures show the ratio of the signal to SL sample curves. The event samples include tf, W+jet, 
and single-top-quark events. 
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Figure 9: The distributions of E™ 1SS for the SL control sample for events selected with the 
(a) 1BL, (b) 2BT, and (c) 3B requirements, except for the requirement on E™ 1SS . The simulated 
spectra are normalized as in Fig. [2] The hatched bands show the statistical uncertainty on the 
total SM prediction from simulation. The open dashed histogram shows the expectations for 
the Tltttt NP model with m g = 925 GeV, m LSP = 100 GeV, and normalization to NLO+NLL 
(the corresponding contributions from the Tlbbbb model are negligible and are not shown). 

Based on these observations, we implement a template method in which the shape of the E™ ss 
distribution in an SL sample is used to describe the shape of the E™ ss distribution in the cor- 
responding signal sample of Table [lj for all top-quark and W+jets categories. We split each 
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Table 5: The relative systematic uncertainties (%) for the nominal top-quark and W+jets back- 
ground estimate in the signal regions. 





1BL 


1BT 


2BL 


2BT 


3B 


MC closure 


4.6 


15 


5.4 


4.6 


2.8 


Subtraction of QCD 


13 


19 


8.2 


20 


8.0 


Subtraction of Z — > vv 


3.4 


3.9 


5.4 


5.9 


15 


MC subtraction 


0.6 


0.6 


0.2 


0.4 


0.1 


Total 


14 


25 


11 


21 


18 



SL sample into a sideband E™ 1SS region SB-SL defined by 150 < £™ ss < 250 GeV, and a signal 
£nuss re gj on SIG-SL given by the corresponding E™ 1SS requirement in Table [lj The templates are 
normalized based on the number of top-quark plus W+jets events observed in the SB regions 
(150 < E™ 1SS < 250 GeV) of samples selected with the requirements of Table [l] except for that 
on E™ 1SS . A schematic diagram of the different regions used to evaluate the top and W+jets 



background with the nominal method is presented in Fig. 10 Contributions to the SB region 
from QCD and Z — > vv events are taken from the data-based estimates of Sections 5T and 52 
Small, residual contributions from other backgrounds such as diboson events are subtracted 
using simulation. 
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Figure 10: Schematic diagram illustrating the regions used to evaluate the top and W+jets back- 
ground with the nominal method. The sideband (SB) and signal (SIG) regions are described in 
the caption to Fig. [6] The sideband-single-lepton (SB-SL) and signal-single-lepton (SIG-SL) re- 
gions correspond to the SB and SIG regions, respectively, except an electron or muon is required 
to be present and a requirement is placed on the transverse W boson mass Mj < 100 GeV. 



Our estimate of the top-quark and W+jets background in the SIG region is therefore: 

top+W _ NsiG-SL 



N 



SIG 



SL 
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SB 



other,MC 
SB 



)• 



(3) 



Systematic uncertainties are summarized in Table [5] We consider the systematic uncertainty 
associated with MC closure, evaluated as described in Section |5.1| The closure is evaluated 
separately for the nominal combined top-quark and W+jets simulated sample, with the W+jets 



5.4 Top-quark and W+jets background (Ey lss -reweighting) 13 



cross section increased by 50% and the single-top-quark cross section by 100%, and with the 
W+jets cross section decreased by 50% and the single-top-quark cross section by 100%. We take 
the largest closure discrepancy as the uncertainty. We also consider the systematic uncertainty 
associated with subtraction of the QCD- and Z — > vv-background estimates in the SB region, 
evaluated by varying these estimates by their uncertainties. The systematic uncertainty asso- 
ciated with other backgrounds is evaluated by varying the MC -based background estimates in 
the SB region by their uncertainties, which we assume to be ±100% for these small terms. 

5.4 Top-quark and W+jets background (£^ lss -reweighting) 

We perform a second, complementary evaluation of the top-quark and W+jets background, 
which we refer to as the E™ iss -reweighting method. The E™ 188 distribution is determined sepa- 
rately for each of the three principal top-quark and W+jets background categories: 



1. top-quark or W+jets events in which exactly one W boson decays into an e or ]i, or into a 
T that decays into an e or p, while the other W boson (if any) decays hadronically; 

2. top-quark or W+jets events in which exactly one W boson decays into a hadronically 
decaying r, while the other W boson (if any) decays hadronically; 

3. tf events in which both W bosons decay into an e, p or t, with the T decaying either 
leptonically or hadronically. 



For the 1BL selection, these three categories represent, respectively, approximately 44%, 49%, 
and 7% of the total expected background from top-quark and W+jets events, as determined 
from simulation. 



5.4.1 Single e or ^ events: category 1 

Category 1 top-quark and W+jets background is evaluated with the SL data control sample 



introduced in Section 5.3 To relate event yields in the SL and SIG samples, we use constraints 
derived from knowledge of the W-boson polarization. The polarization of the W boson gov- 
erns the angular distribution of leptons in the W boson rest frame. Because forward-going lep- 
tons are boosted to higher momentum, and backward-going leptons to lower momentum, the 
W-boson polarization is directly related to the lepton momentum spectrum in the laboratory 
frame. W-boson polarization is predicted to high precision in the SM, with calculations carried 
out to the next-to-next-to-leading order for tf events [28J and to NLO for W+jets events p9|. 
The results of these calculations are consistent with measurements I30 - 33H. 



To construct a distribution sensitive to the W-boson polarization in W — > Iv (£ = e, p.) events 
(we include W — > rv — > Ivvv events in this category), we calculate the angle A9j between 
the direction of the W boson in the laboratory frame and the direction of the e or ji in the W 
boson rest frame, all defined in the transverse plane. The pj of the W boson is given by the 
vector sum of the E™ 18S and charged lepton pj vectors. When A8j is small, the charged lepton 
is produced along the pj direction of the W boson, typically resulting in a high-px charged 
lepton and a low-px neutrino (and therefore low E™ 188 ) in the laboratory frame. Such events 
usually appear in the SL sample. Conversely, when A6j is large, the charged lepton (neutrino) 
has lower (higher) pj, typically leading to larger E™ 188 , a charged lepton that fails our e or p 
identification criteria, and an event that appears as background in the signal samples. 



Figure 11 shows the distribution of A6j in data and simulation for SL events selected with the 



1BL, 2BT, and 3B criteria, except a looser E™ 88 requirement (E™ 188 > 250 GeV) is used for the 2BT 
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Figure 11: The distributions of A8j for events with a single e or p for the (a) 1BL, (b) 2BT, 
and (c) 3B selection criteria except with a loosened E™ ss restriction for (b) as described in the 
text. The stacked, filled histograms show simulated predictions for events in the SL sample. 
The dashed histogram shows the corresponding simulated prediction in the limit of perfect 
charged lepton reconstruction. The simulated results are normalized as in Fig. |2j 



Table 6: The relative systematic uncertainties (%) for the E™ iss -reweighting estimate of the top- 
quark and W+jets background, for category 1 (category 2) events. 





1BL 


1BT 


2BL 


2BT 


3B 


tr(W -4 £v)/<r(tt) ratio 


0.1 (0.7) 


3.3 (3.9) 


0.1 (0.3) 


0.2 (0.3) 


0.6 (0.3) 


Lepton efficiency 


2.0 (2.0) 


2.0 (2.9) 


2.0 (2.0) 


2.0 (2.2) 


2.0 (2.0) 


Top-quark pj spectrum 


0.1 (2.2) 


6.8 (0.7) 


0.6 (3.2) 


1.6 (0.7) 


1.6 (2.7) 


Jet energy scale 


1.6 (3.0) 


5.0 (5.2) 


1.7 (2.1) 


1.2 (4.9) 


1.1 (4.1) 


Jet energy resolution 


0.2 (0) 


0.4 (1.6) 


0.2 (0.2) 


0.5 (0.4) 


0.3 (0.2) 


b-tagging efficiency 


0.2 (0.4) 


1.0 (2.8) 


0.4 (0.6) 


0.5 (0.5) 


0.3 (0.4) 


MC closure 


10 (4.7) 


55 (29) 


12 (5.1) 


17(16) 


21 (6.6) 


T visible energy 


-(1.5) 


-(3.1) 


-(1.9) 


-(2.0) 


-(2.1) 


Total 


10 (6.5) 


56 (30) 


12 (7.0) 


17(17) 


21 (8.7) 



region to reduce statistical fluctuations. These results can be compared to those expected in the 



limit of perfect charged lepton reconstruction, indicated by the dashed histograms in Fig. 11 
which show the corresponding simulated predictions, including simulation of the detector, for 
top-quark and W+jets events with a single W — > Iv decay, where the e or yi needs only to be 
present at the generator level. The difference between the dashed histogram and the sum of the 
histograms with exactly one true e or y. found in the event represents W — > iv events in which 
the e or yi is either not reconstructed or does not meet the selection criteria of Section |3j 

To estimate the E™ 1SS distribution of category 1 events, we measure the E™ 1SS distribution of 
SL events in bins of Adj. The E™ 1SS distribution for each bin is then multiplied by an MC 
scale factor, determined as follows. The numerator equals the difference between the total 
yield from single-lepton processes (the dashed histograms in Fig. [TTJ and the subset of those 
events that enter the SL sample, both determined for that bin. The denominator equals the 
corresponding number of events that appear in the SL sample from all sources. The definition 
of the denominator therefore corresponds to the SL observable in data. The normalization 
of the E™ 1SS distribution in each A8j bin is thus given by the corresponding measured yield, 
corrected by a scale factor that accounts for the e or yi acceptance and reconstruction efficiency. 
The corrected E™ 1SS spectra from the different Adj bins are summed to provide the total E™ ss 
distribution for category 1 events. 

Systematic uncertainties are summarized in Table |6j To evaluate a systematic uncertainty asso- 
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ciated with the relative tt and W+jets cross sections, we vary the W+jets cross section by ±50%. 
From studies of Z — > £ + £~ events, the systematic uncertainty associated with the lepton re- 
construction efficiency is determined to be 2%. A systematic uncertainty associated with the 
top-quark pj spectrum is evaluated by varying the W-boson pj distribution in the simulated tf 
sample. In these variations, the number of events in the upper 10% of the distribution changes 
by two standard deviations of the corresponding result in data. The systematic uncertainties 
associated with the jet energy scale, jet energy resolution, and b-tagging efficiency are evalu- 
ated as described in Section [7] A systematic uncertainty to account for MC closure is evaluated 
as described in Section IB31 

5.4.2 t — ► hadrons: category 2 

Category 2 top-quark and W+jets background is evaluated using a single-muon data control 
sample. The muon in the event is replaced with a simulated hadronically decaying t (a T jet) 
of the same momentum. To account for the addition of the T jet, the initial selection criteria are 
less restrictive than those of the nominal analysis. We require two or more jets, E™ ss > 100 GeV, 
and do not place restrictions on Hj or A<£> m i n . To ensure compatibility with the triggers used to 
define this single-muon control sample, the minimum muon pj is set to 25 GeV, and the muon 
isolation requirement is also more stringent than the nominal criterion of Section |3j 

The visible energy fraction of the T jet, namely its visible energy divided by its pj value, is 
determined by sampling pj-dependent MC distributions ("response templates") of the x visible 
energy distribution, for a given underlying value of T pj. The T jet visible energy is added to 
the event. The modified event is then subjected to our standard signal region selection criteria. 
A normalization factor derived from simulation accounts for the relative rates of category 2 
and single-muon control sample events. 

The same systematic uncertainties are considered as for category 1 events. In addition, we 
evaluate an uncertainty for the T jet visible energy by varying the T energy scale by ±3% H34X 
Systematic uncertainties are summarized in Table |6j 

5.4.3 tt dilepton events: category 3 

The contribution of category 3 top-quark and W+jets background events is determined using 
dilepton data control samples. When both leptons are electrons or both are muons, or when 
one is an electron and the other a muon (where the e or p can either be from a W boson or 
t decay), we use simulated predictions to describe the shape of the E™ 1SS distribution. The 
normalization is derived from data, by measuring the number of dilepton events that satisfy 
loosened selection criteria for each class of events (ee, pp, or ep) individually. The measured 
value is multiplied by an MC scale factor, defined by the number of corresponding tt dilepton 
events that satisfy the final selection criteria divided by the number that satisfy the loosened 
criteria. 

When one or both of the leptons is a hadronically decaying t, we apply a procedure similar 
to that described for category 2 events. Data control samples of ep+jets and pp+jets events are 



selected with the loosened criteria of Section 5.4.2 One or both muons is replaced by a r-jet 



using MC response templates. The signal sample selection criteria are applied to the modified 
events, and the resulting E™ 1SS distributions normalized by scaling the number of events in the 
respective control samples with factors derived from MC simulation. 

The Elf 1 ' 3 ' 3 distributions of all six dilepton categories are summed to provide the total category 3 
prediction. A systematic uncertainty is evaluated based on MC closure in the manner described 
in Section IBTTl 
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6 Likelihood analysis 



Table 7: The SM background estimates from the procedures of Sections 5.1 5.4 in comparison 
with the observed number of events in data. The first uncertainties are statistical and the second 
systematic. For the total SM estimates, we give the results based both on the nominal and E™ ss - 
reweighting methods to evaluate the top-quark and W+jets background. 





IBL 


IBT 


2BL 




2BT 


3B 


QCD 


28 ± 3 ± 12 


0.0 ± 0.2 ± 0.3 


4.7 ±1.3 ±2.8 


0.8 


±0.4 ±1.2 


1.0 ± 0.5 ± 0.5 


Z — >■ vv 


154 ± 20 ± 32 


2.4 ±1.9 ±0.5 


32 ± 5 ± 20 


6.2 ±2.0 ±3.9 


4.7 ±1.3 ±6.5 


top quark & W+jets: 
nominal 


337 ±30 ±63 


6.5 ±3.3 ±1.8 


123 ±17 ±19 


22.! 


i ±6.9 ±5.5 


8.8 ±4.0 ±1.8 


gmiss -reweighting 


295 ± 16 ± 17 


4.0 ±1.2 ±1.5 


116±8±8 


19.! 


I ±2.5 ±2.2 


13.6 ±3.2 ±1.2 


Total SM: 














nominal 


519 ± 36 ± 72 


8.9 ±3.8 ±1.9 


159 ± 18 ± 28 


29.! 


* ± 7.2 ± 6.8 


14.4 ±4.2 ±6.8 


gmiss -reweighting 


477 ±26 ±38 


6.4 ± 2.3 ± 1.6 


153 ± 10 ± 22 


26.! 


1 ± 3.2 ± 4.6 


19.3 ±3.5 ±6.6 


Data 


478 


11 


146 




45 


22 



5.5 Summary of the data-based background estimates 

A summary of t he b ackground estimates is given in Table [7| The results from the three cate- 
gories of Section 



5.4 



are summed to provide the total E™ 1SS -reweighting top-quark and W+jets 
prediction. The estimates from the E™ iss -reweighting method are seen to be consistent with 
those from the nominal method and to yield smaller uncertainties. Note that there are statisti- 
cal correlations between the nominal and E™ iss -reweighting methods because they both make 
use of the SIG-SL region of Fig. 10 However, the nominal method relies on the SB and SB-SL 
regions of Fig. 
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while the E™ ss -reweighting method does not. The E^ uss -reweighting method 
makes use of MC scale factors and data selected with lepton-based triggers (for category 2 and 3 
events), while the nominal method does not. Furthermore, the systematic uncertainties of the 
two methods are largely uncorrected (compare Tables [5] and [6). 

The data are generally in good agreement with the SM expectations. However, for 2BT, the data 
lie 1.1 and 2.2 standard deviations (c) above the predictions (including systematic uncertain- 
ties) for the nominal and E™ 1SS -reweighting methods, respectively. For 3B, the corresponding 
deviations are 1.2o~ and 0.7a. Since these deviations are not significant, we do not consider 
them further. 



As an illustration, Fig. 12 presents the background predictions in comparison to data for the 
IBL, 2BT, and 3B selections. These results are based on the nominal top-quark and W+jets 
background estimate. 



6 Likelihood analysis 

We perform a global likelihood fit that simultaneously determines the SM background and 
yield of a NP model, using the background estimation techniques of Section |5j The likeli- 
hood analysis allows us to treat the SM backgrounds in a more unified manner than is possible 
through the collection of individual results in Table [7[ Furthermore, it allows us to account for 
NP contributions to the control regions ("signal contamination"), as well as to the signal region, 
in a comprehensive and consistent manner. 

It is difficult to account for signal contamination using the E™ 1SS -reweighting method, in con- 
trast to the nominal method. Therefore, signal contamination is evaluated for the nominal 
method only. Of the two NP scenarios we consider, one of them, the Tltttt model, exhibits 
non-negligible contamination of the SL samples, while the other, the Tlbbbb model, does not. 
Since the Tlbbbb model does not exhibit significant signal contamination, we employ both the 
nominal- and E™ iss -reweighting-based likelihood fits for this model. For the Tltttt model, we 
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CMS, is = 7 TeV, L, = 4.98 fb" 1 CMS, is = 7 TeV, L = 4.98 fb" 1 CMS, is = 7 TeV, L = 4.98 fb _1 
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E™ iss [GeV] Ef s [GeV] ~ E™ ss [GeV] 

Figure 12: The data-based SM background predictions for E™ 1SS in the (a) 1BL, (b) 2BT, and 
(c) 3B signal regions in comparison to data. The top-quark and W+jets estimate is based on the 
nominal method. The hatched bands show the total uncertainty on the prediction, including 
systematic uncertainties. The uncertainties are correlated between bins. The open histograms 
show the expectations for the Tlbbbb (solid line) and Tltttt (dashed line) NP models, both with 
ms = 925 GeV, m LSP = 100 GeV, and normalization to NLO+NLL. 



Table 8: The observables (number of data events) of the likelihood analysis for the nominal 
method, representing the signal region and ten control regions. The seven observables listed in 
the upper portion of the table are subject to contributions from the signal model in our analysis. 
The E™ 1SS SB region corresponds to 150 < E™ 1SS < 250 GeV while the SIG regions correspond to 
the E™ 1SS regions listed in Table nl The low A</> m i n region corresponds to A<p m [ n < 4.0. 



SIG 
SB 

SIG-LDP 

SB-LDP 

SIG-SL 

SB-Se 

SB-S^ 



SIG-ee 
SB-ee 
SIG-/^ 
SB-}ipi 



Standard selection, E™ 1SS SIG region 
Standard selection, E™ 1SS SB region 
Standard selection, E™ 1SS SIG /low A<£> m i n region 
Standard selection, E™ 1SS SB/low A</> m i n region 
Single-lepton selection, E™ ss SIG region 
Single-electron selection, E™ 1SS SB region 
Single-muon selection, £™ 1SS SB region 



Z —7 e + e~ selection, E™ 1SS SIG region 
Z — > e + e~ selection, E^ 1SS SB region 
Z — » }i + pi~ selection, E™ 1SS SIG region 
Z — > }i + pi~ selection, E^ 1SS SB region 



employ only the likelihood fit based on the nominal method. 

For the nominal method, the data are divided into 11 mutually exclusive bins, corresponding 
to the 11 observables listed in Table |HJ where each "observable" corresponds to the number of 
data events recorded for that bin. Note that the SB-SL events of Fig. [10] are divided into two 
components, one for electrons (denoted SB-Se) and the other for muons (denoted SB-Sf/), be- 
cause their trigger efficiencies and uncertainties differ. Similarly, the reconstruction efficiencies 
of Z — > e + e~ and Z — > events differ, so we divide the Z — > £ + £~ events of Section 5.2 

according to the lepton flavor. We further divide the Z — > £ + £~ events according to whether 
they appear in the sideband (150 < E2p iss < 250 GeV) or signal regions (Table [lj) of E™ 1SS . The 
four Z —7 £ + £~ samples are denoted SIG-ee and SlG-fiji for events in the signal regions, and 
SB-ee and SB-fifi for events in the sideband region. 

The likelihood model provides a prediction for the mean expected value of each observable in 
terms of the parameters of the signal and background components. The likelihood function 
is the product of 11 Poisson probability density functions, one for each observable, /3 distribu- 
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7 Limits on the Tlbbbb and Tltttt models 



Table 9: The relative systematic uncertainties (%) for the signal efficiency of the Tlbbbb SMS 
model with niz = 925 GeV and ?«lsp = 100 GeV. 





1BL 


1BT 


2BL 


2BT 


3B 


Tpt ptiptp'v scalp 


2.1 


11 


2.1 


3.5 


1.9 


Unclustered energy 


0.2 


0.8 


0.2 


0.2 


0.2 


Jet energy resolution 


1.0 


2.0 


1.0 


1.0 


1.0 


Pileup 


1.0 


1.0 


1.0 


1.0 


1.0 


b-jet tagging efficiency 


0.8 


0.9 


3.8 


3.9 


9.0 


Trigger efficiency 


3.6 


3.6 


3.6 


3.6 


3.6 


Parton distribution functions 


0.4 


1.6 


0.4 


0.7 


0.5 


Anomalous E™ 1SS 


1.0 


1.0 


1.0 


1.0 


1.0 


Lepton veto 


3.0 


3.0 


3.0 


3.0 


3.0 


Luminosity 


2.2 


2.2 


2.2 


2.2 


2.2 


Total uncertainty 


5.9 


12 


7.0 


7.6 


11 



tions 1 35 1 that parametrize efficiencies and acceptances, and f> distributions [ 35 1 that account 
for systematic uncertainties and uncertainties on external parameters. (External parameters 
include such quantities as the acceptance A and scale factors between the samples with loose 



and nominal b-tagging requirements discussed in Section 5.2 ) The new physics scenarios con- 
sidered here can contribute significantly to the seven observables listed in the upper portion 
of Table [H] In our model, the relative contributions of NP to these seven observables are taken 
from the NP model under consideration. The NP yield in the SIG bin is a free parameter. The 
NP contributions to the other six bins thus depend on the NP yield in the SIG bin. 

Analogous procedures are used to define the likelihood function for the E™ iss -reweighting 
method, with simplifications since there is no SB region in this case. 

The likelihood function is used to set limits on NP models. Upper limits at 95% confidence 
level (CL) are evaluated taking into account the effects of variation of the external parameters 
and their correlations. All upper limits are determined using a modified frequentist technique 

(cu) mw\- 



7 Limits on the Tlbbbb and Tltttt models 

Simulated Tlbbbb and Tltttt event samples are generated for a range of gluino and LSP masses 
using PYTHIA, with Mlsp < wig. For increased efficiency when performing scans over the 
SMS parameter space (see below), we base simulation of the CMS detector response on the 
fast simulation program [38], accounting for modest differences observed with respect to the 
GEANT4 simulation. 

Systematic uncertainties on signal efficiency are summarized in Table |9j using the Tlbbbb 
benchmark model as an example. A systematic uncertainty associated with the jet energy scale 
is evaluated by varying this scale by its pj- and f/-dependent uncertainties. A systematic uncer- 
tainty associated with unclustered energy is evaluated by varying the transverse energy in an 
event that is not clustered into a physics object by ±10%. The systematic uncertainties associ- 
ated with the correction to the jet energy resolution, the pileup reweighting method mentioned 
in Section |3j the b-jet tagging efficiency scale factor, and the trigger efficiency, are evaluated 
by varying the respective quantities by their uncertainties. The uncertainty for the trigger ef- 
ficiency includes a 2.5% uncertainty for the plateau efficiency. Systematic uncertainties asso- 
ciated with the parton distribution functions are evaluated following the recommendations of 
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Ref. I|39| . The systematic uncertainty associated with anomalous E™ 1SS values, caused by beam 
background and reconstruction effects, is 1%. The systematic uncertainty associated with the 
lepton veto is determined from studies of Z — > £ + £~ events in data to be 3.0%. The uncertainty 
in the luminosity determination is 2.2% |40| . 

We determine 95% CL upper limits on the SMS cross sections as a function of the gluino and 
LSP masses. Using the NLO+NLL cross section as a reference, we also evaluate 95% CL ex- 
clusion curves. The jet energy scale, unclustered energy, parton distribution function, and b-jet 
tagging efficiency uncertainties are evaluated for each scan point. Other uncertainties are fixed 
to the values in Table [9] For each choice of gluino and LSP mass, we use the combination of the 
top-quark and W+jets background estimation method, and the signal selection (Table [T]), that 
provides the best expected limit. We do not include results for points close to the mg = m^sp 
diagonal because of neglected uncertainties from initial-state radiation (ISR), which are large 
in this region. Specifically, we remove from consideration any point for which the signal effi- 
ciency changes by more than 50% when the PYTHIA ISR calculation is turned off (for the default 
simulation, the ISR calculation is turned on). 



For the Tlbbbb model, the E™ iss -reweighting method is always found to provide the best 
expected result: we therefore use this method to determine the Tlbbbb limits. The E™ 1SS - 
reweighting method incorporates an additional constraint compared to the nominal method, 
namely the normalization of the SM prediction for the E™ 1SS distribution from the SIG-SL sam- 
ple (Fig. 10 1, and not merely the E™ 1SS distribution shape. As a consequence, it has greater 
discrimination power against NP scenarios. 
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Figure 13: (a) The 95% CL observed cross section upper limits (UL) for the Tlbbbb SMS model, 
based on the E™ ss -reweighting method to evaluate the top-quark and W+jets background. For 
each point, the selection that provides the best expected cross section limit is used. The solid 
contour shows the 95% CL exclusion limits on the gluino and LSP masses using the NLO+NLL 
cross section for new physics. The dashed contours represent the theory uncertainties, (b) The 
corresponding expected limits. The dashed contours represent the uncertainties on the SM 
background estimates. 



The results for Tlbbbb are shown in Fig. 13 'a). The 1BT selection is found to provide the best 
expected result in the bottom right corner of the distribution, corresponding to the region of 
large gluino-LSP mass splitting. The 2BT selection is best for the swath roughly parallel to the 
diagonal defined by gluino masses between around 650 and 900 GeV along the bottom edge 
of the plot. The 3B selection is generally best elsewhere. The solid contour shows the 95% CL 
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8 Summary 




gluino mass [GeV] gluino mass [GeV] 

Figure 14: (a) The 95% CL observed cross section upper limits (UL) for the Tltttt SMS model, 
based on the nominal method to evaluate the top-quark and W+jets background. For each 
point, the selection that provides the best expected cross section limit is used (the best selection 
is virtually always 3B). The solid contour shows the 95% CL exclusion limits on the gluino and 
LSP masses using the NLO+NLL cross section for new physics. The dashed contours represent 
the theory uncertainties, (b) The corresponding expected limits. The dashed contours represent 
the uncertainties on the SM background estimates. 



exclusion curve for the reference cross section. The zigzagging structure around mg = 900 GeV, 
wzlsp = 450 GeV is due to the transition from the region where 3B is the best expected selection 
to that where 2BT is best, in conjunction with the slight excess observed in data for the 2BT 
selection in comparison with the SM prediction for the E™ iss -reweighting method (Section 5.5 i. 



The dashed contours represent the results when the reference cross section is varied by the 
theory uncertainty fl"2|. Our results improve those of Refs. [6, 7] for large LSP mass values. 
For example, for gluino masses around 800 GeV, we extend the exclusion of the reference cross 
section from an LSP mass of about 400 GeV [7] to about 500 GeV, where these numerical values 
are given by the observed results minus the one standard deviation theory uncertainties. 

Fig. [l3|(b) shows the best expected results for the Tlbbbb model. In this case the dashed con- 
tours represent the results when the SM background estimates (Table [7|) are varied by their 
uncertainties. 



The corresponding results for the Tltttt model are presented in Fig. 14 Our Tltttt results are 
based on the nominal top-quark and W+jets background estimation method for the reason 
stated in Section [6] In this case, the best expected selection is essentially always 3B. Note that 
the observed limits for Tltttt, shown in Fig [l4ja), are not as stringent as the expected limits, 
shown in Fig[l4^b), because of the slight excess of data events in the 3B sample for the nominal 
method, compared to the SM expectation (Table [7). 



8 Summary 

In this paper, we present a search for an anomalous rate of events with three or more jets, 
at least one, two, or three tagged bottom-quark jets, no identified, isolated leptons, and large 
missing transverse energy E™ ss . The study is based on a sample of proton-proton collision 
data collected at y/s = 7 TeV with the CMS detector at the LHC during 2011, corresponding to 
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an integrated luminosity of 4.98 fb~ . The principal standard model backgrounds, arising from 
top-quark, W+jets, Z+jets, and QCD-multijet events, are evaluated from data. We introduce a 
variable A^ m i n that allows us to address the QCD-multijet background with a simple approach. 
The top-quark and W+jets background is evaluated with two complementary methods, which 
yield consistent results. In the E™ iss -reweighting method to evaluate the top-quark and W+jets 
background, we introduce a technique based on the W polarization in tt and W+jets events. 
Our analysis is performed in a likelihood framework in order to account for backgrounds, and 
for new physics contamination of the control regions, in a unified and consistent manner. 

We find no evidence for a significant excess of events beyond the expectations of the standard 
model and set limits on new physics in the context of the b-jet-rich Tlbbbb and Tltttt simplified 
model spectra, in which new strongly interacting particles decay to two b-quark jets, or two t- 
quark jets, plus an undetected particle. For the Tlbbbb scenario, our results are substantially 
more stringent than those of Ref. [6] and improve on those of Ref. [7] for large LSP masses. 
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